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ABSTRACT 

In Lima et al. (2008) we presented a new method for estimating the redshift 
distribution, N{z), of a photometric galaxy sample, using photometric observables 
and weighted sampling from a spectroscopic subsamplc of the data. In this paper, 
we extend this method and explore various applications of it, using both simulations 
of and real data from the SDSS. In addition to estimating the redshift distribution 
for an entire sample, the weighting method enables accurate estimates of the red- 
shift probability distribution, p{z), for each galaxy in a photometric sample. Use of 
p{z) in cosmological analyses can substantially reduce biases associated with tradi- 
tional photometric redshifts, in which a single redshift estimate is associated with 
each galaxy. The weighting procedure also naturally indicates which galaxies in the 
photometric sample are expected to have accurate redshift estimates, namely those 
that lie in regions of photometric-observable space that are well sampled by the spec- 
troscopic subsample. In addition to providing a method that has some advantages over 
standard photo-z estimates, the weights method can also be used in conjunction with 
photo-z estimates, e.g., by providing improved estimation of N{z) via deconvolution 
of -/V(2;phot) and improved estimates of photo- z scatter and bias. We present a publicly 
available p(z) catalog for ~ 78 million SDSS DR7 galaxies. 

Key v^rords: distance scale - galaxies: distances and redshifts - galaxies: statistics - 
large scale structure of Universe 



1 INTRODUCTION 

Optical and near-infrared wide-area surveys planned for the 
next decade will increase the size of photometric galaxy sam- 
ples by an order of magnitude, delivering measurements of 
billions of galaxies. Much of the utility of these samples for 
astronomical and cosmological studies will rest on knowledge 
of the redshift distributions of the galaxies they contain. For 
example, surveys aimed at probing dark energy via clusters, 
weak lensing, and baryon acoustic oscillations (BAO) will 
rely on the ability to coarsely bin galajcies by redshift, en- 
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abling approximate distance-redshift measurements as well 
as study of the growth of density perturbations. The power 
of these surveys to constrain cosmological parameters will be 
limited in part by the accuracy with which the galaxy red- 
shift distributions can be determined (Huterer et al. 2004, 
2006; Zhan & Knox 2006; Zhan 2006; Ma et al. 2006; Lima 
& Hu 2007). 

Photometric redshifts — approximate estimates of 
galaxy redshifts based on their broad-band photometric 
observables, e.g., magnitudes or colors — offer one set of 
techniques for approaching this problem. However, photo-z 
estimators are typically biased to some degree, and they 
can suffer from catastrophic failures in certain regimes. 
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These problems motivate the development of potentially 
more robust methods. 

In Lima et al. (2008) we presented a new, empirical 
technique aimed not at estimating individual galaxy red- 
shifts but instead at estimating the redshift distribution, 
N{z), for an entire photometric galaxy sample or suitably 
selected subsample. The method is based upon matching the 
distributions of photometric observables (e.g., magnitudes, 
colors, etc) of a spectroscopic subsample to those of the pho- 
tometric sample. The method assigns weights to galaxies in 
the spectroscopic subsample (hereafter denoted the training 
set, in analogy with machine- learning methods of photo-z 
estimation), such that the weighted distributions of observ- 
ables for these galaxies match those of the photometric sam- 
ple. The weight for each training-set galaxy is computed by 
comparing the local "density" of training-set galaxies in the 
multi-dimensional space of photometric observables to the 
density of the photometric sample in the same region. We es- 
timate the densities using a nearest-neighbor approach that 
ensures that the density estimates are both local and sta- 
ble in sparsely occupied regions of the space. The use of the 
nearest neighbors ensures optimal binning of the data, which 
minimizes the requisite size of the spectroscopic subsample. 
After the training-set galaxy weights are derived, we sum 
them in redshift bins to estimate the redshift distribution 
for the photometric sample. 

As Lima et al. (2008) show, this weighting method pro- 
vides a precise and nearly unbiased estimate of the underly- 
ing redshift distribution for a photometric sample without 
recourse to photo-z estimates for individual galaxies. More- 
over, the spectroscopic training set does not have to be rep- 
resentative of the photometric sample, in its distributions 
of magnitudes, colors, or redshift, for the method to work. 
(By contrast, the performance of training-set-based photo-z 
estimators generally degrades as the training set becomes 
less representative of the photometric sample.) The only re- 
quirement is that the spectroscopic training set covers, even 
sparsely, the range of photometric observables spanned by 
the photometric sample. The weighting method can be ap- 
plied to different combinations of photometric observables 
that correlate with redshift — here, we confine our analysis 
to magnitudes and colors. 

In this paper we present additional applications of the 
weighting method, test its performance on simulated data 
sets, and show results of those applications using data from 
the SDSS. The applications of the weighting method nat- 
urally fall into two categories, those that enhance photo-z 
estimators and those that (potentially) replace photo-z es- 
timation. In the first category, we show that the weighting 
method can be used to improve estimates of the scatter and 
bias of training-set-based photo-z estimates as functions of 
(true) spectroscopic redshift. Knowledge of such errors are 
very important, since uncertainties in photo-z bias and scat- 
ter are nuisance parameters that significantly degrade the 
power of cosmological probes (eg. Huterer et al. 2004; Ma 
et al. 2006; Lima & Hu 2007). We also show that the weights 
can be used to obtain improved estimates of the error distri- 
bution of the photo-^'s, P(2phot|zspec), and thereby improve 
the deconvolution procedure used to infer the underlying 
redshift distribution, N{z), from the distribution of photo- 
n's (Padmanabhan et al. 2005). 

In the second category of applications, we consider the 



weighting technique on its own, independently of 'tradi- 
tional' photo-z estimates. The accuracy of the weighting 
method in directly reconstructing N{z) is affected by pho- 
tometric errors and by sparse or incomplete coverage by the 
training set of the space of photometric observables spanned 
by the photometric data. We develop and test a bootstrap 
technique to estimate random errors in the weighted N{z) 
estimate and present a technique for detecting systematic 
errors in it as well. We also discuss the effects of training-set 
non-representativeness on the N{z) estimate. Perhaps most 
importantly, we show that the weighting procedure can be 
used to estimate not only the redshift distribution for the 
(entire) photometric sample, N{z), but also a redshift prob- 
ability distribution, p{z), for each galaxy in the photometric 
sample. Such a distribution contains much more information 
than a discrete photo-z estimate, Zphot. Use of p{z) instead 
of Zphot in cosmological analyses can potentially greatly re- 
duce the biases arising from photo-z's. 

The paper is organized as follows. In §2 we review and 
extend the weighting method for estimating the redshift dis- 
tribution and the redshift probability distribution, focus- 
ing in particular on sources and estimates of errors in the 
method. In §3 we describe the actual and simulated SDSS 
galaxy catalogs that we use to test the weighting method and 
its alternatives. We demonstrate how the weighting method 
improves upon photometric-redshift estimates in the mock 
catalog in §4, and we demonstrate its effectiveness in esti- 
mating N{z), in comparison with photo- z-based methods, in 
§5. We apply the new methods to the real SDSS DR6 in §6. 
We present our conclusions in §7 and include some technical 
details of the analysis in the Appendices. 



2 THE WEIGHTING METHOD 

In this section, we briefly review and extend the weight- 
ing method introduced in Lima et al. (2008). We define the 
weight, w, of a galaxy in the spectroscopic training set as 
the normalized ratio of the density of galaxies in the photo- 
metric sample to the density of training-set galaxies around 
the given galaxy. These densities are calculated in a local 
neighborhood in the space of photometric observables, e.g., 
multi-band magnitudes. More formally, given a training-set 
galaxy, we define its weight by 



— AT IT ' ^ ' 

JVP.tot pT 

where iVp^tot is the total number of galaxies in the photo- 
metric sample, and pp and pT are the local number densities 
in the space of observables for the photometric and training 
sets. 



where A'pjt) is the number of photometric (training) set 
galaxies within volume Vp(t). 

We adopt a nearest-neighbor approach to estimating the 
density of galaxies in magnitude space, because it enables 
control of statistical errors (shot noise) while also ensuring 
adequate "locality" of the volume in magnitude space. We 
define the distance d^p in magnitude space between the a*'^ 
and galaxies in a (photometric or spectroscopic) sample 
using a Euclidean metric. 
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(do/s)^ = (m„ - m^)^ = ^(mj^ - m^)^ , (3) 

a=l 

where Nm. denotes the number of magnitudes (i.e., different 
passbands) measured for each galaxy. We use this distance 
to find the set of nearest neighbors to the a"^ object, i.e., 
the set of galaxies with the smallest dap- For a fixed num- 
ber of nearest neighbors A^nei , if we order the neighbors by 
their distance from the a**" galaxy, then we can define the 
hypervolume in terms of the distance from galaxy a to the 
Nl^i nearest neighbor, indexed by 7, i.e., Vm = ((^07)'^'". 

Estimating the local density in the spectroscopic train- 
ing set using a fixed value for A''(mQ)T = Nnd ensures that 
the density estimate is positive-definite and that the result- 
ing weight is well defined. To estimate the corresponding 
density in the photometric sample, we simply count the 
number of galaxies in the photometric sample, A^(mQ)p, 
that occupy the same hypervolume Vm around the point 
nia. Since the densities are estimated in the spectroscopic 
and photometric sets using the same hypervolume, the ratio 
of the densities in Eqn. 1 is simply the ratio of the cor- 
responding numbers of objects within the volume, and the 
weight for the a"^ training-set galaxy is therefore given by 

_ 1 iV(m^)p , . 

" iVp.tot iV(m„)T ■ ^ ' 

Nnci can be chosen to balance locality, which favors small 
Vm, against statistical errors, which favor large Nnci- 

2.1 Weights and the Redshift Distribution N{z)p 

As shown in Lima et al. (2008), by construction the weighted 
spectroscopic training set has essentially identical distribu- 
tions of multi-band magnitudes and colors as the photomet- 
ric sample from which it is drawn, even though the spectro- 
scopic set is in general not representative of the photometric 
sample. The weighting procedure in effect corrects for that 
non-representativeness, provided the training set adequately 
spans the range of the photometric-observable space covered 
by the photometric sample. Since the weighted training set 
has identical distributions of photometric observables as the 
photometric sample, it is reasonable to assume that the for- 
mer also provides an accurate estimate of the binned redshift 
distribution of the photometric sample, 

N{z)„ei = N{zi <Z< Z2)p = ^ wpN{zi <Zp < Z2)t,(5) 

13^1 

where the weighted sum is over all galaxies in the train- 
ing set. Lima et al. (2008) show that this indeed provides 
a nearly unbiased estimate of the redshift distribution of 
the photometric sample, A^(2:)p, under suitable conditions. 
Examples of this application will be discussed in §5.1. 

2.2 Weights and the Redshift Probability 
Distribution p{z) 

Although knowledge of the redshift distribution for a pho- 
tometric sample, A*'(2:)p, is sufficient for many applications. 



there are of course instances in which one would like red- 
shift information about individual galaxies in the sample. 
As noted in the Introduction, photo-z estimators provide 
one approach to this problem. However, photo-z estimates 
are limited by the fundamental assumption that there is a 
functional relationship between the photometric observables 
and redshift. In fact, galaxies occupying a small cell in the 
space of photometric observables will have a range of red- 
shifts. One can therefore associate that cell with a redshift 
probability distribution function (PDF), p(z|observables). 
The shape of the PDF is determined by the choice of ob- 
servables, the size of the cell, the photometric errors, and 
the range of spectral energy distributions of the galaxies. If 
the PDF is narrowly peaked, photo-2 estimates can be both 
precise (small scatter) and accurate (small bias). However, if 
the distribution is broad, skewed, or multiply peaked, then 
photo-2: estimates will suffer large scatter, bias, and poten- 
tially catastrophic failures. The ubiquitous positive bias of 
photo- jz estimates for low-redshift galaxies and negative bias 
for high-redshift galaxies are consequences of this fundamen- 
tal assumption. Low- and high-redshift objects can in some 
cases occupy the same cell of magnitude space, but photo-z 
estimators will assign them all essentially the same redshift. 

To overcome these problems and avoid the biases in- 
trinsic to photo-2 estimates, it is preferable to use the full 
redshift PDF for the galaxies in a small cell in the space of 
photometric observables, p{z) = p(2|observables). This PDF 
encodes all the information available about the redshift of an 
individual galaxy in a photometric sample. One can choose 
to extract a single redshift estimate from the PDF, e.g., its 
mean, median, or mode, but often that is not necessary in 
applications. 

The weighting method described above can be straight- 
forwardly applied to estimate p{z) using a spectroscopic 
training set. The estimator p{z) for a galaxy in the photo- 
metric sample is given by the weighted redshift distribution 
of its A'^nei ucarcst neighbors in the training set, using the 
metric of Eqn. 3, 

JVnoi 

Piz) = ^ wisSiz - zp) , (6) 

where, as before, A^noi can be determined from simulations 
by minimizing the sum of the shot-noise and "non-locality" 
errors. In practice, we estimate p{z) in redshift bins. This es- 
timate for p{z) was used in a study of galaxy-galaxy lensing 
by Mandelbaum et al. (2007) and was shown to yield signif- 
icantly smaller lensing calibration bias than use of photo-2 
estimates. 

We can also construct a new estimator for N[z)p by 
summing the p[z) distributions for all galaxies in the pho- 
tometric sample, 

iVp.tot 

iV(2)p = ^ p.(2) . (7) 
i = l 

This estimator is similar but not identical to that of Eqn. 
5. We will see in §5.1.3 that these two are comparable in 
recovering the true redshift distribution of a photometric 
sample. 
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2.3 Sources of Errors in the Weighting Method 

The errors arising in the weights method can be considered 
the errors in estimating p(z;|observables) for a galaxy in the 
photometric sample from the information in the training set. 
Any differential selection effect between the spectroscopic 
and photometric samples will lead to errors in 'p{z). There 
are several kinds of selection effects: (1) statistical effects, (2) 
large-scale structure (LSS), (3) spectroscopic failures in the 
training set, (4) survey selection in the photometric observ- 
ables, (5) survey selection in non-photometric observables, 
and (6) non-locality of the weights. 

Statistical errors arise because the training set is just 
a subsample of the photometric survey and is subject to 
statistical fluctuations. These fluctuations can be significant 
in regions of magnitude space where the training set is very 
sparse. In such regions, the shot-noise errors in p{z) will 
either be large or else the nearest-neighbor volume must be 
made large, leading to increased non-locality (see below). 
Statistical errors can be estimated by bootstrap resampling 
the training and photometric sets. If the magnitude errors 
are well known, one can further Monte Carlo resample the 
magnitudes. We present results of bootstrap error estimation 
in §5.1.4. 

Errors due to LSS can be significant if certain regions 
of the space of photometric observables are only represented 
in the training set by a spectroscopic survey that covers a 
small solid angle, in which one or a few large structures dom- 
inate. In this case, p(z|observables) for the training set will 
comprise one or a few redshift spikes rather than a smooth 
distribution. If these effects occur in regions of magnitude 
space where the true redshift PDF is broad or multiply 
peaked, they can potentially cause systematic errors in the 
estimates of p{z) or N[z) for the photometric sample. The 
resulting errors may be large if the linear size of the training- 
set volume is not large compared to the galaxy clustering 
correlation length. The errors from LSS can in principle be 
estimated by constructing mock training-set volumes using 
N-body simulations of structure formation. 

Spectroscopic failures, i.e., targeted objects in the train- 
ing set for which redshifts could not be obtained, can also 
lead to systematic errors in p(z) if the failures happen sys- 
tematically, for instance, if they occur preferentially for a 
particular galaxy spectral type and if that type has a dif- 
ferent redshift PDF from other galaxy types in the same 
region of magnitude space. Since such spectroscopic fail- 
ures will tend to occur in specific and identifiable regions 
of magnitude space, however, one can at minimum excise or 
down-weight those regions in estimating quantities for the 
photometric sample (see §2.4), at the cost of incompleteness. 

The severity of these systematic errors is regulated by 
the width of the redshift PDF. In the limit of a large number 
of photometric observables with very small measurement er- 
rors and a large spectroscopic training set, the redshift PDF 
in a small cell in magnitude space approaches a delta func- 
tion. In this regime, the effects of LSS and of spectroscopic 
failures would be simply to increase the statistical errors in 
certain regions of observable space, an effect accounted for in 
the bootstrap error estimate. As one moves away from this 
ideal limit, the systematic errors grow, in the sense that one 
can no longer reliably estimate p(z|observables) for a galaxy 
in the photometric sample from its training-set neighbors. 



That effect is not captured by the bootstrap and must be 
estimated by other means, e.g., using simulations. The mock 
SDSS DR6 catalog we have constructed for this paper (see 
§3.2) does not simulate LSS or spectroscopic failures; we 
plan to study such effects in the future. Some of the surveys 
that comprise the training set for the real DR6 data are in- 
dividually affected by LSS effects. Having a combination of 
them helps to alleviate the problem, though more testing is 
required to quantify the possible systematics. 

LSS and spectroscopic failures lead to unavoidable dif- 
ferences in the selection functions for the photometric and 
spectroscopic samples. In addition, there are differential se- 
lection effects that are built in by those designing the spec- 
troscopic survey. For example, one typically makes magni- 
tude and color cuts in selecting spectroscopic targets from 
a photometric sample. In this case, where the selection is 
made explicitly in the photometric observables, there will 
be regions of observable space where the weights cannot be 
used to reliably estimate redshift distributions. Again, such 
regions are known from the target selection cuts and can 
be safely excised from the photometric sample (see §2.4). 
If, on the other hand, there are differences in spectroscopic 
and photometric selection based on non-photometric observ- 
ables, then systematic errors in ^{z) can occur. 

A variant of this problem arises when the training set 
is selected using photometric observables that are different 
from the ones measured in the photometric sample. For ex- 
ample, for the SDSS DR6 photometric catalog, the spectro- 
scopic target selection for the DEEP2 sample in the train- 
ing set used a different magnitude system (coming from dif- 
ferent photometric samples) from the SDSS. Similarly, the 
selection of the 2SLAQ spectroscopic catalog made use of 
photometric observables that were not used in the photo-z 
estimation. Whether such cuts will cause systematic errors 
depends on how well the selection in those systems can be 
approximated using the SDSS ugriz filters. 

Finally, the non-locality of the weights solution is a 
source of systematic error. Here, non-locality refers to the 
fact that, in the nearest-neighbor approach, we are using in- 
formation from a finite volume to estimate the density at 
a point in observable space, and the density varies over the 
space. This procedure corresponds to applying a smooth- 
ing kernel to the density field. Non-locality becomes a prob- 
lem if the volume occupied by the neighbors (or the scale 
of the smoothing kernel) becomes comparable to or larger 
than the scale over which the density changes appreciably. 
In this limit, the shape of the volume used to select the near- 
est neighbors may be important. Non- locality errors are re- 
duced by choosing a smaller neighbor volume for the density 
estimate, but at the cost of increasing the shot-noise errors. 
Ultimately, the combined errors can be reduced by increas- 
ing the density of the training set in a particular region of 
observable space, i.e., by measuring more spectra. 

2.4 Selecting the "Recoverable" Part of a 
Photometric Sample 

One of the necessary conditions for the weights procedure to 
work is that the spectroscopic training set covers the same 
region of photometric observables as the photometric sam- 
ple. That is, the weights can only recover the redshift PDF 
of a galaxy in the photometric sample if it lies in the region 
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of intersection of the redshift-observables hypersurfaces of 
the training and photometric sets. Defining this region of 
intersection is not always trivial, especially given the high 
number of dimensions that may be involved. To do so, we 
count how many times a galaxy in the photometric sample is 
used in the weights calculation for all members of the train- 
ing set. By definition, photometric galaxies that are never 
counted in the weights procedure are not in the region of in- 
tersection, hence the redshift distribution of those galaxies 
will not be accurately recovered by the weighting procedure. 
We make use of this criterion below. If one does not require 
the photometric sample to be complete, one can choose to 
excise such galaxies from consideration. Using several real 
and mock catalogs, we have found empirically that using 
~ 5 nearest neighbors in the weights calculation is optimal 
for determining the intersection region for the mock catalog. 

As examples, consider the mock and real SDSS DR6 
catalogs of §3. From Figs, lb, 2b, and 3b, one might expect 
that the combined training set covers the same region of 
observables as the photometric sample. However, using the 
definition of the previous paragraph, more than ~ 43% of 
the mock photometric-sample galaxies are not used in the 
weights calculation, i.e., they are not well represented in 
the training set. Fortunately, in the real SDSS DR6 catalog, 
by the same criterion we find that ~ 98% of photometric- 
sample galaxies with r < 22 are well represented in the 
training set. It is important to apply such a recoverability 
test whenever a training-set method is used. 



3 CATALOGS 

To test the performance of the weighting method and com- 
pare it with standard photo-z estimates, we employ two 
kinds of catalogs. The first is drawn from the SDSS DR6 
(Adelman-McCarthy et al. 2007) photometric sample and 
various spectroscopic subsamples of it and allows us to dis- 
play results of the weighting method on real data. The sec- 
ond is a mock catalog constructed to have properties similar 
to the SDSS DR6 photometric and spectroscopic samples. 
The goal of the mock catalog is not to precisely reproduce all 
features of the SDSS catalog but to have a sample with re- 
alistic spectroscopic and photometric features and for which 
we have ground truth (i.e., redshifts and galaxy types) for 
all galaxies. In this section, we describe the relevant fea- 
tures of the real and mock catalogs, relegating the details to 
Appendix A. 

3.1 SDSS DR6 Data 

The SDSS DR6 photometric and spectroscopic data samples 
are drawn from those used by Oyaizu et al. (2008) to produce 
a neural network photo-z catalog. 

3.1.1 Photometric Sample 

We use a random 1% subset of the galaxies in the SDSS 
DR6 Photoz2 catalog described in Oyaizu et al. (2008) as 
our photometric sample. This subset contains approximately 
769,582 galaxies with r < 22. The catalog is approximately 
flux-limited at this magnitude limit. For details of the parent 
sample, see Appendix A and Oyaizu et al. (2008). The r 



magnitude, g — r, and r — i color distributions are shown in 
the bottom right panel of Fig. la and the bottom panels of 
Fig. 2a. 

3.1.2 Spectroscopic Training Set 

The spectroscopic training sample we use for SDSS DR6 is 
drawn from a number of spectroscopic galaxy catalogs that 
overlap with SDSS DR6 imaging. We impose a magnitude 
limit of r < 23 on the spectroscopic samples as well as addi- 
tional cuts based on the quality of the spectroscopic redshifts 
reported by the different surveys (see Appendix A). The 
SDSS spectroscopic sample provides 531, 594 redshifts, prin- 
cipally from the MAIN and Luminous Red Galaxy (LRG) 
samples. The remaining redshifts are: 20, 381 from the Cana- 
dian Network for Observational Cosmology (CNOC) Field 
Galaxy Survey (CNOC2; Yee et al. 2000), 1,531 from the 
Canada-France Redshift Survey (CFRS; Lilly et al. 1995), 
11,040 from the Deep Extragalactic Evolutionary Probe 
(DEEP; Davis et al. 2001) and DEEP2 (Weiner et al. 2005), 
654 from the Team Keck Redshift Survey (TKRS; Wirth 
et al. 2004), and 52,762 LRGs from the 2dF-SDSS LRG 
and QSO Survey (2SLAQ; Cannon et al. 2006). 

The r-magnitude and color {g — r and r — i) distributions 
for the spectroscopic samples are shown in Figures la and 
2a. Although the magnitude and color distributions of the 
combined spectroscopic sample are not identical to those 
of the photometric sample, the spectroscopic sample does 
span the ranges of apparent magnitude and colors of the 
photometric sample. Fig. 3a gives the spectroscopic redshift 
distribution for the combined spectroscopic sample. 

3.2 SDSS DR6: Mock Catalog 

Using spectral template libraries and observational data on 
the redshift-dependent luminosity functions of galaxies of 
different types, we have constructed mock photometric and 
spectroscopic samples that reproduce the main features of 
the real SDSS DR6 samples. We describe these briefly below. 

3.2.1 Mock Photometric Sample 

The simulated SDSS catalog contains lO'^ galaxies with red- 
shift z < 2.0 and magnitude 14 < r < 22. We use the 
If _mock_schechter code from the kcorrect package (Blan- 
ton et al. 2003) to generate redshift, type, and i- magnitude 
relations. The inputs to the code are the redshift range, 
Schechter luminosity function parameters, and the ranges of 
absolute and apparent r-magnitudes. The code outputs a list 
of redshifts and apparent r-magnitudes. We set the range of 
absolute i-band magnitudes to (—24, —14). Using data from 
the VVDS survey, Zucca et al. (2006) estimated galaxy lu- 
minosity functions and Schechter-function flts thereto for 
different galaxy types in redshift bins of size Az = 0.2 from 
Zmin = 0.2 to Zmax = 1-5. We flt simple polynomial func- 
tions to the Schechter parameters of Zucca et al. (2006) to 
derive a continuous relationship between the Schechter pa- 
rameters M* , a, (j)* , redshift z, and galaxy type T, using the 
centroid of each redshift bin for the fit. To regularize the fits, 
we visually extrapolated the results of Zucca et al. (2006) to 
the z = (0, 0.2) bin and, where needed (for certain galaxy 
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Figure 1. Normalized r magnitude distributions for the catalogs 
comprising the real SDSS DR6 (top; figure a) and the mock SDSS 
DR6 {bottom: figure b) catalogs. In each figure, the top four pan- 
els indicate the distributions for the different spectroscopic sub- 
samples (see text), bottom left panels indicate flux distributions 
for the combined spectroscopic samples, and bottom right pan- 
els distributions for the photometric samples. In each panel, Ntot 
denotes the total number of galaxy measurements used in each 
sample. 
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Figure 2. Distributions of g — r and r — i colors for the catalogs 
comprising spectroscopic training and photometric sets for the 
real SDSS DR6 {top: figure a) and the mock SDSS DR6 {bottom: 
figure h). Top rows give distributions for the SDSS spectroscopic 
sample, middle rows the distributions for the other spectroscopic 
samples, bottom rows the distributions for the photometric sam- 
ples. The real and mock SDSS spectroscopic color distributions 
differ primarily because the latter does not include LRGs. 



types), for the (1.2, 1.5) bin. The detailed fits are given in 
Appendix B. 

Galaxy colors are generated using the four Coleman, 
Wu, & Weedman spectral templates (Coleman et al. 1980) — 
E, Sbc, Scd, Im — extended to UV and NIR wavelengths us- 
ing synthetic templates from Bruzual A. & Chariot (1993). 
These templates are mapped to galaxy SED type T (used by 



Zucca et al. (2006)) as (E, Sbc, Scd, Im) T = (1, 2, 3, 4). 
To improve the sampling and coverage of color space, we 
have created additional templates by interpolating between 
adjacent templates. The redshift, r-magnitude, and type re- 
lations are first generated without photometric errors; errors 
are then added to produce observed magnitudes. Magnitude 
errors are modeled as sky-background dominated errors ap- 
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Figure 3. {Top, figure a) Spectroscopic redshift distribution for 
the combined SDSS DR6 spectroscopic training set. {Bottom, fig- 
ure b) Spectroscopic redshift distributions for the mock SDSS 
DR6 training and photometric sets. 



proximated as Gaussians that are uncorrelated between the 
diflFerent SDSS filters. 

The resulting magnitude and color distributions for the 
mock photometric sample are shown in the lower right panel 
of Fig. lb and the bottom panels of Fig. 2b. The redshift 
distribution for the sample is shown as the dark grey region 
in Fig. 3b. The r-magnitude distribution of the mock pho- 
tometric sample peaks at slightly brighter magnitude than 
for the actual DR6 photometric sample, and the r — i dis- 
tribution is slightly less peaked than that of the real data, 
but overall the real and mock distributions are quite similar 
in their photometric properties. As noted above, the goal of 



the mocks is not to exactly reproduce the real data distri- 
butions. 



3.2.2 Mock Spectroscopic Training Set 

We construct the mock spectroscopic training set by piecing 
together a variety of different catalogs with different selec- 
tion functions, each meant to qualitatively represent one of 
the spectroscopic training samples described above in §3.1.2. 
We obtain each component catalog of the training set by 
generating an independent realization of the mock photo- 
metric sample and applying the selection cuts of the spec- 
troscopic catalog to the realization. The selection cuts we use 
for each component spectroscopic catalog are given in Ta- 
ble 1. As discussed in Appendix A2, many of the real training 
set galaxies are located in the southern celestial stripe, which 
was imaged repeatedly by the SDSS. In the real training set, 
multiple photometric measurements of the same galaxy were 
treated as independent. We have simulated this effect in the 
mock training set by regenerating the magnitudes of each 
galaxy in the mock training sets as needed. The number of 
unique mock galaxies and total number of galaxies (count- 
ing all realizations of the same galaxy as different objects) 
are shown in the second and third columns of Table 1. For 
comparison, we have also generated spectroscopic catalogs 
with the same total number of objects but using only unique 
objects. We found no discernible differences in the resulting 
photo-z's or weights. 

The r-magnitude, color {g ~ r and r ~ i), and spectro- 
scopic redshift distributions of the spectroscopic samples for 
the mock SDSS DR6 data are shown in Figs. 1, 2, and 3. 
As is evident from comparison of the a and 6 components 
of Figs. 1 and 2, there are some noteworthy differences be- 
tween the selection cuts used for the mock training set and 
the actual target selection cuts applied in constructing the 
spectroscopic surveys described in §3.1.2 and Appendix A2. 
For example, for the SDSS spectroscopic catalog, the mock 
sample is flux limited at r = 18, while the actual spectro- 
scopic catalog comprises the MAIN sample, with a flux limit 
of r = 17.7, and the LRG sample, with red colors and a 
flux distribution that peaks around r ~ 19. For the other 
spectroscopic surveys, the actual photometric selection cuts 
were typically made in non-SDSS passbands, while our mock 
data and selection cuts were generated using the SDSS ugriz 
bands. Therefore, the mock photometric cuts do not exactly 
match the actual cuts used. As a result of this mismatch, 
e.g., the peak of the r-magnitude distribution of the mock 
2SLAQ sample is about one magnitude fainter than the cor- 
responding peak in the real data, as shown in the upper left 
panels of Figs, la and b. 



4 APPLICATIONS OF THE WEIGHTING 

METHOD I: IMPROVING PHOTOMETRIC 
REDSHIFT MEASURES 

With the mock and real galaxy catalogs in hand, we can 
now test the performance of the weighting method in differ- 
ent applications. In this section, we describe the utility of 
the weighting method in improving the performance of tra- 
ditional photo-2 estimates. In the next section, we use the 
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Table 1. Mock Spectroscopic Training Set Properties: Number of galaxies and photometric selection cuts applied. 



Catalog Unique objects All Objects Selection Cuts 

mockSDSS 531,672 531,672 r s: 18.0 

mockDEEP+DEEP2 2,419 31,716 g-r < 2.35(r - i) - 0.45, 

g - r < 1.95, 1.1 < r - i < 2, 

r < 22 

mockTKRS+CFRS+CNOC2 1,827 23,681 u < 23, g < 23, 

r < 22, i < 22 

mock2SLAQ 11,082 51,251 ((r - i) - (g - r))/8 > 0.55, 

0.7{g - r) + 1.2(r - i - 0.18) > 1.6, 

17.5 s£ i si 19.8, 

0.5 < g - r < 3, r - j < 2 



weighting method to directly estimate A'^(2) and compare 
the results with photo-z-based estimates. 

4.1 Estimating Photo-z Bias and Scatter 

We have applied an Artificial Neural Network (ANN) photo- 
2 estimator, described in Appendix C and in more detail in 
Oyaizu et al. (2008), to the SDSS DR6 mock catalog of §3.2. 
Despite the fancy name, an ANN is simply a function which 
relates redshifts to photometric observables. The training 
set is used to determine the best-fit value for the free pa- 
rameters of the ANN. The best-fit parameters are found 
by minimizing the overall scatter (see definition below) of 
the photo-z's determined for the training set galaxies. The 
ANN configurations are not unique in the sense that differ- 
ent sets of parameters can result in the same overall scatter. 
The best-fit parameters found after minimizing the scatter 
depend on where in parameter space the optimization run 
begins. Hereafter we refer to an ANN function using a given 
set of best-fit parameters as a neural network solution. The 
network is trained on the mock spectroscopic training set 
described in §3.2.2 and used to estimate redshifts for the 
mock photometric sample of §3.2.1. We have also trained 
and applied the network using the real DR6 data described 
in §3.1. 

The results of the ANN photo-2 estimator are displayed 
in Fig. 4, which shows the inferred redshift Zphot vs. true red- 
shift 2spec. Panel (b) shows the results for the mock spectro- 
scopic training set, while panel (c) shows the results for the 
mock photometric sample. For comparison, panel (a) shows 
results for the real SDSS DR6 training set data. As was seen 
in Fig. 3b, the redshift distribution of the mock photometric 
sample is considerably deeper than that of the mock training 
set. Not surprisingly, the photo-2 errors as a function of red- 
shift for the mock photometric sample are somewhat larger 
than one would estimate based on the training set (compare 
the 68 and 95% contours in panels (b) and (c)). This is a 
problem since, for real (as opposed to mock) galaxy cata- 
logs, one does not have the information necessary to make 
panel (c), i.e., one can only estimate photo-z performance 
using the training set. Since the training set is, as in this 
mock example, usually not representative of the photomet- 
ric sample, the statistics of photo-z quality for the training 
set are not accurate indicators of photo-2 quality for the 
photometric sample. 



To make this point more quantitative, we consider two 
standard statistical measures of photo-z quality, the scatter 
and bias as functions of spectroscopic redshift, 

Cr^{zj) = (l/A''-,)E^^|2phot,! - Zspoc,i|^, (8) 
K^j) = (l/^j)SSl(^phot,» - ZspocO, (9) 

where Nj is the number of objects in the jth Zgpec bin, i.e., 
with true redshifts in the interval Zj ± Az. Fig. 5 shows 
these measures for the mock training sample (left panels) 
and photometric sample (right panels) for five different neu- 
ral network solutions. These five solutions come from net- 
works with the same structure (same number of layers and 
nodes per layer, see Appendix C) but with different initial 
values for the network weights Wiafj . The left panels of Fig. 
5 show that the different solutions yield essentially identi- 
cal results for the scatter and bias for the training set, but 
the right panels show a dispersion of quality measures for 
the photometric sample. We can address this issue by work- 
ing with the average of the five photo-z solutions for each 
galaxy. The solid (black) curve in the top right panel of Fig. 
5 shows that the average photo-2 solution results in a h{z) 
that is the average of the biases of the individual neural net 
solutions, as may be expected. The bottom right panel of 
Fig. 5 shows a more interesting result, that the scatter of 
the average photo-2 solution is considerably smaller than 
the average scatter of the individual neural net solutions. 

Even if one uses the average photo-2 solution, compar- 
ison of the left and right panels of Fig. 5 demonstrates the 
qualitative point made above, that the scatter and bias vs. 
redshift for the training set are not accurate estimators of 
the scatter and bias over the full redshift range for the pho- 
tometric sample. As shown more explicitly in Fig. 6, the 
training set scatter and bias tend to underestimate those 
measures for the photometric sample, particularly at red- 
shifts Zspec < 0.3. This is simply because the training-set 
objects are generally brighter than those in the photometric 
set at similar redshift, which implies that the training-set 
galaxies have smaller photometric errors and consequently 
smaller photo-z errors. 

The weighting procedure provides a straightforward av- 
enue for addressing this problem of estimating the photo- 
z scatter and bieis for the photometric sample. Since the 
weighted training set has, by construction, magnitude dis- 
tributions similar to those of the photometric set, we can 
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Figure 4. z-p^^^ vs. ^spcc for {from left to right): (a) the real SDSS DR6 training set, (b) the mock SDSS training set, (e) the full mock 
photometric set, and (d) the recoverable mock photometric set, i.e. the part of the mock photometric set that is well represented in the 
training set. The dashed and dotted curves enclose 68% and 95% of the points in each Zspec bin. In the lower right of each panel, a is 
the rms photo-2 scatter averaged over all objects in the catalog, and crgg is the range containing 68% of the objects in the distribution 

of ^phot .2spec. 



instead use weighted versions of (j{z) and h{z) for the train- 
ing set as estimates of the scatter and bias for the photo- 
metric set, i.e., 

al,{zj) = (l/A''j)E^^Wi|2phot,i - 2spcc,i|^, (10) 
b-w{zj) = (l/A''j)E^i«;i(2;phot,i - Zspcc,i), (11) 

where the weights Wi are given by Eqn. (1) and the sums 
are over all objects in the training set. Fig. 6 shows the 
scatter and bias for the training set, the weighted training 
set, and the full photometric set, where the average photo- 
z of the five neural network solutions has been used. We 
see that the weighted training set yields estimates of scatter 
and bias that are much closer to those of the photometric set 
over the entire redshift range. Moreover, as noted in §2.4, we 
expect the weighting method to work best for the recoverable 
portion of the photometric sample. Fig. 6 also shows the 
scatter and bias vs. redshift for the recoverable photometric 
sample, showing that the weighted training-set estimates are 
very accurate in this case. 

Since the weights can be used to improve the estimates 
of photo-2 scatter and bias for the photometric set, one 
might hope that the weights could also be used to improve 
the photo-2 solution itself. However, because of the large 
number of degrees of freedom of the ANN, most of the infor- 
mation for the photo- 2 solution comes from small regions in 
the space of photometric observables around each training- 
set object. The weights do not vary strongly over those small 
regions, and therefore the photo-z solution does not change 
significantly between the unweighted and weighted cases. 



4.2 Estimating Photo-z Errors 

As demonstrated above, the weighting procedure improves 
the estimates of photo-2 scatter and bias for a photometric 
sample but does not improve the photo-2 accuracy itself. An- 
other issue, which we now discuss, is the accuracy of photo-2 
error estimates. 

We estimate photo-z errors for objects in the photomet- 
ric catalog using the Nearest Neighbor Error (NNE) estima- 
tor (Oyaizu et al. 2008). The NNE method is training-set 
based and associates photo-2 errors to photometric objects 
by considering the errors for objects with similar multi-band 



magnitudes in a spectroscopic sample, hereafter termed the 
"validation set". The validation set is chosen to be inde- 
pendent of the training set in order to avoid the issue of 
over-fitting, i.e., so that the ANN is not trained to fit the 
statistical fiuctuations of the training set, which would result 
in NNE underestimating the photo-2 errors. 

The NNE procedure to estimate the redshift error (tnne 
for a galaxy in the photometric sample is as follows. Using 
the distance measure of Eqn. (3), we find the validation- 
set nearest neighbors in magnitude space to the galaxy of 
interest. Since the selected nearest neighbors are in the 
spectroscopic sample, we know their photo-2 errors, 5z — 
Zphot — 2spoc, where 2phot has been estimated using the neu- 
ral network method. We calculate the 68% width of the Sz 
distribution for the neighbors and assign that number as the 
photo-2 error estimate for the photometric galaxy. Here we 
select the nearest 100 neighbors of each object to estimate 
its photo-2 error. In studies of photo-2 error estimators ap- 
plied to mock and real galaxy catalogs, we found that NNE 
accurately predicts the photo-z error when the training set 
is representative of the photometric sample (Oyaizu et al. 
2008). Here we investigate what happens when the training 
set is not representative, and we also consider the impact 
of weighting the neighbors using Eqn. (4) in computing the 
NNE estimate. 

Figure 7 shows the distributions of (2phot — •Zspec)/o"NNE, 
i.e., the photo-2 error distribution normalized by the NNE 
error estimate ctnne, for the training set (upper left panel), 
for the photometric set using unweighted (upper right) and 
weighted (lower left) validation-set objects, and for the re- 
coverable photometric set (lower right) using the weighted 
validation set. The dashed curves in these panels show Gaus- 
sian fits to the error distributions; we also indicate the 
best-fit Gaussian means (/xcauss) and standard deviations 
(cTGauss), as well as the ctbs widths (about zero) of the distri- 
butions (not of the fits). The Gaussian fits give equal weight 
to each bin of the distributions and ignore objects for which 
o"NNE ~ 0. We see that the overall normalized error distribu- 
tions are close to Gaussian for all the catalogs and that there 
is little difference among the four cases. We conclude that 
the NNE error estimate is robust even when the training 
set is not representative and that the weights do not signif- 
icantly affect the NNE estimator. In retrospect the latter is 



© 0000 RAS, MNRAS 000, 000-000 



10 Cunha et al. 



o - 



O 

I 



T — I — I — I — I — I — r 



-1 — I — I — I — r- 




Average 
Individual NN solutions 



_1 I I I I I I I I I I I I I I I L 



T — I — I — I — n 



O 

I 



I 



0.2 



0.4 



0.6 



0.8 




Average 
Individual NN solution^ 



_1 I I I I I I I I I I I I I I I L 



d 



n 
d 



Average 

Individual NN solutions 




d 



n 
d 



Average 

Individual NN solutions 




0.2 



0.4 



0.6 



0.8 



0.2 



0.4 



0.6 



0.8 



Figure 5. Upper panels: Photo-z bias b vs. Zspec for the 5 neural network photo-x solutions for the mock SDSS sample: {top left) training 
set (unweighted) and {top right) photometric set. Lower panels: Photo-^; scatter cr vs. Zspec for the 5 NN photo-^ solutions of the {bottom 
left) training set and {bottom right) photometric set. 



not too surprising since the NNE estimate is derived from 
a typically small nearest neighbor region, over which the 
weights do not vary strongly. 

4.3 Deconvolving the Photo-z Distribution 

The photometric redshift distribution is the convolution of 
the true redshift distribution Af(zspcc) with the distribution 
of photometric redshift errors. For discrete distributions we 
can express this as 



N {Zp\iot)i — ^ ^ P {Zp\iot\Zspcc) ij N {Zspac) j 



(12) 



where the indices i and j refer to bins of Zphot and Zgpcc, 
respectively, and P(2phot|2spec)ii is the probability that a 
galaxy has photo-z in bin i given that its spectroscopic red- 
shift is in bin j. 



As noted in Padmanabhan et al. (2005), we can solve 
Eqn. (12) for N{zspcg) by inverting P{zphot\zspcc)ii ■ How- 
ever, the inversion problem is ill-conditioned for two reasons. 
First, the convolution is a smoothing operation, and some of 
the information in A''(zspcc)j is irretrievably lost in that pro- 
cess. Second, small errors in P(2phot|2spcc)ij are magnified 
by the matrix inversion. 
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Both problems can be alleviated by using prior infor- 
mation to regularize the inversion and restore some of the 
lost information. Following Padmanabhan et al. (2005), we 
use a forward difference operator, defined as 



5' = 



E 

3=0 



\n{z 



J + 1 



(13) 



as a prior on the smoothness of the reconstruction. To incor- 
porate the prior information into the deconvolution proce- 
dure, we must represent the deconvolution as a minimization 
problem. If we define 



^ ^ IP (^^phot l2spec)ij 



A*'(2spec) 



A^(2pho 



,(14) 



then the deconvolution can be stated as the problem of min- 
imizing Eq with respect to N{z). To incorporate the prior, 
we define 



E = Eo + \S 



(15) 



and the regularized deconvolution is achieved by minimizing 
E. The parameter A sets how much importance is given to 
the smoothing and is often chosen ad hoc. Here, following 
Press et al. (1992), we set 



Tr^P^(aphot|%pec) ■ -P(ZpliotlZspec)^ 



Tri BT ■ B 



(16) 



where B is the {Nun — 1) x (Nhin) first difference matrix 
given by _B = —Sij. This choice of A gives comparable 

weight to both parts of the minimization. 

The preceding discussion summarizes the "standard" 
photo-z deconvolution method for estimating the redshift 
distribution. The weighting method can provide a better 



estimate of P{zphot\zapec)ij for the photometric sample, re- 
ducing the need for regularization and thereby improving 
the deconvolution estimate of N{zspac). We can incorporate 
the weights into the estimation of P(zphot|zspec)ij by cal- 
culating, for each Zspec bin, the Zphot distribution for the 
weighted training-set galaxies. 

We postpone discussion of the performance of the de- 
convolution and weighted deconvolution methods to the 
next section, where we compare them with direct application 
of the weighting method to estimation of N{zspcc). 



5 APPLICATIONS OF THE WEIGHTING 
METHOD II: ESTIMATES OF N{Z) AND 
P{Z) IN MOCK PHOTOMETRIC SAMPLES 

5.1 The redshift distribution N(z) 

We now have at hand a number of methods for estimat- 
ing the true redshift distribution A'^(2) for a photometric 
galaxy sample. Using photo-z's, one can simply use the 
photo-z distribution itself, A'^(2;phot), as an estimator, or 
the deconvolved photo-z distribution described in §4.3, or 
the weighted, deconvolved photo-z distribution mentioned 
at the end of §4.3. Alternatively, one can use the weighted 
spectroscopic redshift distribution of the training-set galax- 
ies to directly estimate N{z), i.e., Eqn. (5), without recourse 
to photo-z's. Finally, we can sum the redshift probability 
distributions p{z) for each galaxy in the photometric sample 
(again estimated from the weighted training set) to estimate 
N{z), using Eqn. (6). In this section, we compare results of 
these different estimates of N{z) using the mock SDSS DR6 
sample. The results are summarized in Tables 2 and 3 and 
the best results for each method are shown in Figs. 8, 9, and 
10. 
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Figure 7. Distributions of (zphot ~ ^spcc) / cr m N E for the {top left) training set, {top right) photometric set (using unweighted validation 
set), {bottom left) photometric set (using weighted validation set), and {bottom right) recoverable photometric set (using weighted 
validation set). 



5.1.1 Measures of Reconstruction Quality 

To compare the different methods, we need a statistical mea- 
sure of the quality of the reconstruction of the estimated 
redshift distribution. We use two. The first is a statistic 
(per degree of freedom and per galaxy), defined here as 



iVbin 



iVbin 



(17) 



the weighting procedure is used or to N{Zpi^a^)^ if the red- 
shift distribution is instead estimated using photo-z's. The 
usual definition of uses the numbers of objects in given 
bins instead of the normalized probability N{z^)\ multiply- 
ing our x? by A'^p^totAz gives the usual definition. We chose 
the above statistic so that it is independent of the number 
of galaxies and the number of redshift bins, allowing us to 
more fairly compare reconstruction quality across different 
data sets. Since the probabilities are normalized, the number 
of degrees of freedom is A'^bin — 1. 



Here Afbin is the number of redshift bins used, Az is the 
width of the bins, and N{z^)^ is equal to A''(2spec)wei if 



The second measure we employ is a binned version of 
the Kolmogorov-Smirnov (KS) statistic, defined as the max- 
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Figure 8. True spectroscopic redshift distribution {solid grey) of 
the mock SDSS photometric sample, and estimates of the redshift 
distribution using the photo-2 distribution {hatched) and decon- 
volved photo-2 distribution {black line). 

imum difference between the two cumulative redshift distri- 
butions being compared, for example, the cumulative dis- 
tributions corresponding to A'^(2Bpoc)wci N{zlpc^Y' . The 
KS statistic is more sensitive to differences in the medians 
of the two distributions being compared, whereas the 
statistic tends to stress the regions of the distribution that 
are least well sampled, i.e., regions where N{z') is small. 
In our implementation, we use binned cumulative distribu- 
tions instead of unbinned cumulative distributions, so this 
statistic is not strictly the KS statistic. 

Note that we do not use the absolute values of these 
statistics as formal goodness-of-fit measures. Rather, we use 
their relative values for the different estimators to compare 
the quality of the different reconstructions — see Table 2. 

5.1.2 Photo-z Estimates of N{z) 

The photo-2 estimate A''(zphot) of the true redshift distribu- 
tion for the mock SDSS photometric sample is shown in Fig. 
8 (hatched histogram). We can see that A'^(2:phot) underes- 
timates the true distribution, A'^(zspoc) (grey histogram), at 
both low and high redshifts and overestimates it at inter- 
mediate redshifts, 0.4 < Zspcc < 0.8. In addition, the peak 
of 7V(2ph ot) is biased with respect to the peak of N(^Zspcc) • 
Comparing the two distributions, we find that = 0.107 
and KS = 0.0848. The photo-z and true redshift distribu- 
tions for the recoverable photometric sample are shown in 
Fig. 9 (hatched and grey histograms). Again, N{zphot) un- 
derestimates N{zBpec) at low and high redshifts and over- 
estimates it in between. The reconstruction statistics are 
similar to those for the full photometric sample, ~ 0.105 
and KS = 0.0674. This indicates that the faithfulness of 
ot) as an estimate of the true redshift distribution is 
not very sensitive to whether the training set is representa- 
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Figure 9. True spectroscopic redshift distribution {solid) of the 
recoverable mock photometric sample, and estimates of the red- 
shift distribution using the photo-2 distribution {hatched) and 
deconvolved photo-z distribution {black line). 

live of the photometric sample: the errors in the recovered 
redshift distribution are dominated by a systematic effect. 
The fact that Af(zphot) is more sharply peaked than N{zBpec) 
is a common feature of training-set-based photo-z estimates 
and results from the breakdown of the fundamental photo-z 
assumption that a single Zphot can represent a full redshift 
distribution ^. For the full photometric sample the peak in 
N{zphot) is not as pronounced as it is for the recoverable 
photometric sample, because the larger photo-z scatter in 
regions not covered by the training set smoothes out the 
peak. 

We have also tested the photo-z deconvolution method 
of §4.3 as an estimate of the redshift distribution. The stan- 
dard (unweighted) deconvolution was not successful at re- 
covering N{z), with = 0.577, KS = 0.124 for the fuU 
photometric sample and x^ ~ 0.499, KS = 0.140 for the 
recoverable photometric sample. The result for the weighted 
deconvolution method, where the weights have been esti- 
mated using the five nearest neighbors, is shown by the black 
line in Fig. 8; it is also not very effective for the full photo- 
metric sample, with x^ = 0.521 and KS = 0.989. Although 
the peak of the deconvolved redshift distribution is at the 
correct redshift, the distribution shows an oscillatory be- 
havior with redshift. However, as shown in Fig. 9 (black line 
vs. grey histogram), the weighted deconvolution performs 
much better for the recoverable photometric sample, with 
X^ = 0.0648 and KS = 0.0266. 

The deconvolution estimate of the redshift distribution 

^ Maximum-likelihood template-fitting photo-2 methods suffer 
from a similar problem but with opposite consequences. Because 
of the different way in which p(2|observables) is estimated in those 
cases, Af(2piiot) tends to be flatter than the true redshift distri- 
bution (Brodwin et al. 2006) 
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Table 2. Rcdshift Distribution Reconstruction Statistics - 30 bins 



Figure 10. True spectroscopic redshift distribution (solid grey) 
of tile recoverable mock photometric sample, and estimates of the 
redshift distribution using the weights (hatched) and p(z) (line) 
methods. 



Full Photometric Set 




KS parameter 


Photo-2 


0.107 


0.0848 


Photo-2 deconvolution (no weights) 


0.577 


0.124 


Photo-2 deconvolution (100 nb) 


0.521 


0.0989 


Weights (100 nb) 


0.0341 


0.0456 


Recoverable Photometric Set 


Photo-2 


0.105 


0.0674 


Photo-2 deconvolution (no weights) 


0.499 


0.140 


Photo-2 deconvolution (2nb) 


0.0682 


0.0295 


Photo-2 deconvolution (5nb) 


0.0648 


0.0266 


Photo-2 deconvolution (lOOnb) 


0.102 


0.0351 


Weights (2 nb) 


0.00624 


0.0129 


Weights (5 nb) 


0.00571 


0.0145 


Weights (100 nb) 


0.00643 


0.0246 


p(z) (2 nb) 


0.00540 


0.0219 


p(z) (5 nb) 


0.00493 


0.0201 


p(z) (100 nb) 


0.00534 


0.0241 



nb = neighbors 



Table 3. Rcdshift Distribution Reconstruction Statistics - 20 bins 



oscillates about the true distribution. This kind of behav- 
ior is typical of the inversion techniques used to perform 
the deconvolution. It can be alleviated by either increas- 
ing the training-set size, decreasing the number of redshift 
bins, or using prior knowledge to improve the estimate of 
f (^^photlzspoc)- We briefly investigate the second of these 
possibilities. As Table 3 shows, using only 20 as opposed 
to 30 redshift bins improves the deconvolution estimate, 
= 0.0509 and KS = 0.0235 (here with weights calcu- 
lated using the two nearest neighbors). However, fewer bins 
means coarser redshift information, so it would be preferable 
to find a method that can accommodate a large number of 
redshift bins. Table 3 also shows that the other methods are 
not as sensitive to the number of bins. The deconvolution 
can also be improved by Monte Carlo resampling the train- 
ing set (Padmanabhan et al. 2005). Ideally, the resampling 
should be done in the space of observables used to calcu- 
late the photo-a's. However, this approach is prohibitively 
time consuming for large datasets, and it requires accurate 
knowledge of the magnitude errors - which may be hard to 
obtain. 



5.1.3 Weighting Method Estimates of N{z) 

The direct estimate of the redshift distribution for the pho- 
tometric sample using the weighting method of Eqn. (5), 
N(z)^ci, is shown by the hatched region in Fig. 10. By con- 
struction, this estimate is the same for both the full and 
the recoverable photometric samples, that is, the weight- 
ing method in practice provides an estimate of the redshift 
distribution for the recoverable photometric sample. Com- 
parison with the true redshift distribution of the recover- 
able sample (solid grey histogram in Fig. 10) shows that the 
weighting method provides the best redshift distribution es- 



Recoverable Photometric Set 




KS parameter 


Photo-2 


0.105 


0.0674 


Photo-2 deconvolution (no weights) 


0.404 


0.125 


Photo-2 deconvolution (2 nb) 


0.0509 


0.0235 


Photo-2 deconvolution (5 nb) 


0.0566 


0.0232 


Photo-2 deconvolution (100 nb) 


0.0971 


0.0290 


Weights (2 nb) 


0.00484 


0.0129 


Weights (5 nb) 


0.00467 


0.01327 


Weights (100 nb) 


0.00547 


0.0232 



nb = neighbors 



timate of any of the methods under consideration here. For 
the full photometric sample, = 0.0341 and KS = 0.0456 
(using 100 nearest neighbors), and for the recoverable sam- 
ple, x^ = 0.00571 and KS = 0.0145 (with 5 nearest neigh- 
bors). As shown in Table 2, A'^(2)wci is relatively insensitive 
to the number of neighbors used in the calculation. 

Finally, using the sum of the p(z) estimates for each 
galaxy in the photometric sample is almost identical to using 
the weights to estimate N(z). The estimate N('Y^p{z)) of 
the redshift distribution is shown by the solid black line in 
Fig. 10, using 5 nearest neighbors to estimate p(z). For this 
case, from Table 2 we have x^ = 0.00493, KS = 0.0241 for 
the recoverable photometric set, quite close to the values for 
the A'^(2;)wci estimate. Table 2 also shows that using fewer 
nearest neighbors slightly improves the KS statistic but not 
the x^ statistic. Moreover, by using fewer neighbors, one is 
unable to accurately characterize p{z), so we caution against 
using fewer than 100 neighbors in the weighted estimate of 
p(z). 
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5.1.4 Error estimate for N(z^ci) 

From Eqns. 1 and 2, the errors in the weights depend upon 
the uncertainties in determining the volumes of the training- 
set and photometric-set regions around an object and upon 
the uncertainties in the number of nearest neighbors for both 
the training and photometric sets. All of these quantities 
are correlated, making error estimation for the weighting 
method a challenge. Instead, we apply a bootstrap resam- 
pling procedure to directly estimate the errors in the quan- 
tity of interest, in this case the weighted estimate of the 
redshift distribution, N{z)v;ei. We sample with replacement 
from the training and photometric sets to generate resam- 
pled training and photometric sets of the same sizes as the 
originals. Then, for each pair of resampled training and pho- 
tometric sets, we calculate the weights using Eqn. (4) and 
generate N{z)^ci using Eqn. (5). We repeat this procedure 
10, 000 times and estimate the covariance matrix by 

C{Za,Zp) = 3_ X 

Us — I 

Tig 

m^'^) - {N{z^)))m^^) - (iv(zfl))) , (18) 

i = l 

where Us is the number of bootstrap samples, Ni{z) is the 
weighted estimate of the redshift distribution in the ith 
bootstrap sample, and {N{z)} is the mean of the bootstrap 
estimates. The correlation matrix is defined in the usual way 
by p{Za, Zp) = C{Za, zp)/a{za)a{zi3). 

Fig. 11 shows N{z)viei (hatched), the mean of the boot- 
strap estimates (solid black), and error bars given by the 
square root of the diagonal elements of the covariance ma- 
trix. There are small anti-correlations between nearby red- 
shift bins, of at most -0.2. Correlations between non-adjacent 
bins are smaller by at least an order of magnitude. 

5.1.5 Correcting Systematic Errors in the N{z) Estimate 

From Fig. 10, we note that the N{z^ci) distribution is 
slightly flatter than N{zspec), a feature that also shows up 
in other catalogs (see, e.g., Lima et al. 2008). This smooth- 
ing of the redshift distribution is a consequence of using 
non- negligibly small regions in magnitude space around the 
training-set galaxies to estimate the weights. This is es- 
pecially problematic for regions where the training set is 
sparse, for then the "neighbor volume" used to calculate 
the weights may be large compared to the typical scale of 
change of the redshift/observable hypersurface. The prob- 
lem is compounded when photometry errors are large, be- 
cause large errors broaden the redshift distribution in a bin 
of observables. Broader distributions require a larger num- 
ber of training-set objects in order to be well characterized, 
but increasing the number of training-set nearest neighbors 
in the weights calculation increases the non-locality of the 
estimate. The ideal solution would be to increase the total 
number of training-set objects in the sample, or at least the 
number in sparsely covered regions, but that is not always 
an option. The poor man's alternative is to develop ways to 
characterize and correct for the systematic errors. 

An empirical approach we have developed makes use 
of the photometric redshifts in the following way. Starting 
with the training set, compute the photo-2 distribution of 
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Figure 11. True spectroscopic redshift distribution (solid grey) of 
the recoverable photometric set, the estimated redshift distribu- 
tion using the weighting method {hatched region), and the mean 
of the bootstrap samples for the weighting method (black line). 
The error bars are given by the square root of the diagonal terms 
of the covariance matrix calculated from the bootstrap samples. 



the weighted training set, N{zphot)wei, i.e., use Eqn. (5) but 
with z replaced by Zphot everywhere. The difference between 
A'^(zphot)wei and the photo-z distribution for the photometric 
sample, N{zphot), is shown by the dotted line in Fig. 12. The 
bias we are actually interested in is A''(2spec)wei — N{zapec), 
shown by the solid line in Fig. 12. We see that these two 
differences have similar behavior with redshift, presumably 
due to similar non-locality of the weight solution in re- 
gions where the training set is sparse. We can therefore use 
A'^(2phot)woi — N{zphot), the bias in the weighted photo-z 
distribution and which is an observable for the photometric 
sample, to estimate A'^(2;spcc)woi — A''(2;spec), the systematic 
error in the weighted estimate of the true redshift distribu- 
tion. The redshift distribution estimate can then be approx- 
imately corrected for this bias. 

To reduce the effect of random errors in the estima- 
tion of the bias, we smooth A''(2phot)woi — N{zpii ot j using a 
"moving window" method. Each redshift window has width 
greater than half of the separation between window cen- 
troids. The smoothing factor is the ratio of the window size 
to the redshift bin size when no smoothing is used. We 
have used smoothing factors of 1, 2, 3, and 5 to calculate 
A'^(-Zphot)woi — N{zpi^ot). A smoothing factor of 1 corresponds 
to a window size of 0.0367 in redshift. We picked the other 
smoothing factors based on the natural scales set by the a 
and (768 of the photo-2's in the training and photometric 
sets. 

Table 4 shows the recovery statistics for the distribu- 
tions corrected for systematics in this way, and Fig. 13 shows 
the improvement in the N(z) estimate when the correction 
with smoothing factor of 2 is applied. While these results 
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Figure 12. Bias in tlic wcighting-metliod estimate of the redshift 
distribution for the recoverable photometric set. Solid line shows 
the bias in the true redshift distribution. Dotted line shows the 
bias in the weighted photo-z distribution, also for the recoverable 
photometric set. Since they approximately match, we can use the 
bias in the weighted photo-z distribution, which is an observable, 
to estimate the bias in the weighted true redshift distribution. 



Table 4. Redshift Distribution Reconstruction Statistics - Cor- 
rection of Systomatics 



Recoverable Photometric Set - 


5 neighbors 


- < 2 < 1.1 


Smoothing factor 




KS parameter 


No correction 


0.00571 


0.0145 


Unsmoothed 


0.00487 


0.0151 


2 


0.00351 


0.0127 


3 


0.00349 


0.0134 


5 


0.00355 


0.0131 


Bootstrap mean (no correction) 


0.00600 


0.0189 



are suggestive, more testing should be done before adopting 
this method as a correction for systematic errors in practice. 



5.2 The Probability Distribution p(z) 

In this section we examine the effectiveness of the weighted 
training set in estimating the redshift probability distribu- 
tion p{z) for individual galaxies, and the relation between 
p{z), Zphot, and Zspoc- For this study, we have increased 
the size of the mock photometric set to 9, 000, 000 galaxies 
in order to improve the statistics. As before, we calculate 
the training-set estimate of p[z), hereafter p(ztrain), for a 
training-set galaxy by selecting its 100 nearest neighbors in 
the training set. The spectroscopic redshift distribution of 
these objects is p(2train). We then select all the galaxies in 
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Figure 13. True spectroscopic redshift distribution (solid grey) 
of the recoverable photometric set, and the estimated redshift 
distribution using the weighting method, showing both the un- 
corrected results {hatched) and results corrected for systematic 
errors (black line) as described in the text. 



the photometric sample that are closer to the given galaxy in 
magnitude space than its 100"'-nearest training-set neigh- 
bor. The spectroscopic redshift distribution of the selected 
photometric galaxies is, barring statistical fluctuations and 
non-locality, the true redshift distribution, hereafter p{ztruc), 
of the region of observable space centered about the selected 
galaxy. 

In Figure 14 we show the redshift distributions for three 
galaxies. In each panel, p{ztiue) is shown as a grey histogram 
with 60 bins, and p(2train) is shown as the hatched histogram 
with 20 bins. We have rescaled the histograms by multiply- 
ing each by the width of the histogram bin for easier com- 
parison of the distributions. The solid vertical line indicates 
the true redshift of the galaxy and the dashed vertical line 
indicates its ANN 2phot estimate. The left panel of the figure 
is for an early-type (T = 1.5) galaxy with r-mag of 20.67 and 
Zspoc ~ 0.48. This galaxy has 4,006 neighbors in the photo- 
metric sample, i.e., that many photometric objects are as 
close to it in magnitude space as its 100 nearest training-set 
neighbors. In this example, the true redshift distribution of 
this region of observable space is narrow, p(2train) is a quite 
accurate estimate of p{ztiue), ^phot is very near Zspcc, and 
both are at the peak of the p{z) distributions. 

The middle panel shows the distributions for a late- 
type (T = 3.1) galaxy with r = 21.1 and Zspcc = 0.56. 
There were 14,606 neighbors to this galaxy in the photo- 
metric sample. With the exception of the extreme tails of 
the distribution, p(ztrain) provides an accurate estimate of 
p(ztruc). The redshift PDF p(ztruc) for this galaxy is much 
broader than that for the galaxy in the left panel, in part 
because the magnitudes of late-type galaxies do not corre- 
late with redshift as well as those of early types. The neural 
network photo-z is 0.39 for this object, higher than the peak 
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of p(2train) at 2 = 0.3 Of its median a.t z = 0.34. The true 
redshift of this object, Zspcc ~ 0.56, is far removed from the 
peak of its redshift distribution. However, the photo-z error, 
2phot — Zspec = 0.16, is comparable to the photo-z scatter at 
this redshift, a{zspcc ~ 0.56) ~ 0.13 (see bottom right plot 
of Fig. 5), which shows that this example is not atypical. 
The broader p(ztrue) is, the more likely it is that Zapec will 
be far from the peak of the distribution. In that case, the 
photo-zestimator cannot zero in on the correct redshift, and 
a single-point Zphot estimate will be a poor redshift estimate 
for a large fraction of the objects in this region of observable 
space. 

The right panel of Fig. 14 shows the distributions for 
another early-type {T = 1.4) galaxy with r = 21.8 and 
Zspec = 0.31, with 18,366 neighbors in the photometric set. 
This is the most pathological of the three examples. The 
large width of p(ztruc) for this galaxy is due to its faint- 
ness, which results in large magnitude errors. The peaks of 
^(ztrain) and p{ztTue) slib offset by ~ 0.1 - 0.2, and p(ztrain) 
shows a spurious second peak at a ~ 1. Such fluctuations are 
not uncommon when one uses 100 galaxies to estimate p{z). 
The true redshift of this galaxy is at the low-redshift tail of 
p(ztrue), and Zphot for this object is catastrophically wrong 
even though it is near the peak of ^(ztrue). The catastrophic 
error results from using a single number to represent a very 
broad distribution, and in this case the galaxy in question 
is quite different from most of its neighbors in magnitude 
space. For a photometric survey, the redshift distribution is 
typically broad near the photometric limit of the survey. To 
avoid catastrophic errors and biases, one should work with 
the full redshift probability distribution per object. 



6 APPLICATION TO SDSS DR6 DATA 

Now that we have tested the weighting method on mock 
SDSS photometric samples, we apply it to the actual SDSS 
DR6 photometric sample. 



6.1 Bias and Scatter in SDSS Photo-z's 

Oyaizu et al. (2008) estimated photo-z's for the SDSS DR6 
photometric sample using an artificial neural network (see 
Appendix C) and several different combinations of photo- 
metric observables. One version, denoted there by Dl, used 
as input observables the five magnitudes ugriz and five con- 
centration indices, also splitting the training set and the 
photometric sample into 5 bins of r magnitude and per- 
forming separate ANN fits in each bin. Version CC2 used as 
inputs the four colors u — g, g — r, r — i, i — z, plus the con- 
centration indices in g, r, and i. Here, as in §4.1, we use the 
weighting method to obtain improved estimates of the bias 
and scatter of these photo-z estimates. Figure 15 shows the 
weighted and unweighted h{z) and rj{z) estimates derived 
from the training set, along with third-order polynomial fits 
to the weighted estimates. The polynomial fit coefficients are 
given in Table 5. The differences between the weighted and 
unweighted b{z) and (j[z) curves are qualitatively consistent 
with the results on the mock sample (Fig. 6), but the real 
data have larger scatter and bias than the mocks. 



6.2 The SDSS Redshift Distribution 

Fig. 16 shows the weighting method estimate, A'^(z)wGi, for 
the redshift distribution of the SDSS DR6 photometric sam- 
ple with r < 22. The error bars on 7V(2:)wci are given by the 
square root of the diagonal elements of the covariance matrix 
obtained by the bootstrap resampling procedure described 
in 5.1.4. 

The coarse-grained structure of the redshift distribution 
is similar to that of the mock SDSS sample (Fig. 3). How- 
ever, the fine-grained structure shows peaks and dips that 
the study of §5.1.5 suggests are indications of systematic er- 
ror. As noted there, large photometric errors, combined with 
sparseness of the training set, can lead to distortions of the 
inferred redshift distribution. This effect is likely present in 
the weighted estimate of the SDSS DR6 redshift distribu- 
tion for galaxies with r < 22. The bump in N{z)v,Bi around 
z = 0.75 is the result of the magnification of the sampling 
errors in the training set caused by the lack of redshift infor- 
mation in the photometry of faint galaxies, combined with 
the lack of training-set coverage in that redshift range. 

When we impose more stringent r-magnitude cuts. Fig. 
17 (left) shows that the feature disappears. In Fig. 17 (right) 
we show the N{z) distribution estimated using p{z) us- 
ing two different training sets. In one case we use the full 
training set to estimate the p(2)'s while in the other we 
remove all galaxies from DEEP/DEEP2 and 2SLAQ (to- 
talling 84,568 galaxies) from the training set and we add 
6,069 from two approximately flux-limited samples, DEEP2- 
EGS (Davis et al. 2007) and zCOSMOS (Lilly et al. 2007), 
which we describe in further detail in §6.3. The bump at 
z = 0.75 disappears when DEEP/DEEP2 is not included, 
showing that the selection in DEEP/DEEP2, which was 
done to target z ~ 0.7 galaxies and in a different photo- 
metric system from SDSS is responsible for the bump. The 
effects of 2SLAQ are much less pronounced, and consist in a 
small overall shift of the distribution. 2SLAQ has morphol- 
ogy cuts (in addition to the SDSS ugriz magnitude cuts) 
which could have yielded some systematic biases. As men- 
tioned previously, the selection effects are amplified by the 
photometry errors, so that the systematics are reduced if 
one imposes more stringent magnitude cuts. If one is primar- 
ily interested in the overall redshift distribution, the N{z) 
estimate using the training set without DEEP/DEEP2 or 
2SLAQ is more reliable. However, if one requires redshift in- 
formation for individual galaxies, the estimate using the full 
training set is still preferable. Without DEEP/DEEP2 and 
2SLAQ, the training set is too sparse at faint magnitudes. 
As a result, the individual p{z) estimates are derived using 
training set objects spread out over a large region of ob- 
servable space, which makes the p{zYs poor representations 
of the local redshift distributions around the corresponding 
galaxies. Given the training sets available, the best way to 
reduce the effects of selection issues while having reliable 
p{z) estimates is to perform magnitude cuts. 

We see another feature in A'^(2:)wci in the range 0.2 < 
2 < 0.4 that does not go away with tighter r-mag cuts 
(see Fig. 17). Similar features can be seen in the zCOS- 
M0S-I-DEEP2/EGS redshift distribution used by Mandel- 
baum et al. (2007) (see the bottom right panel of Fig. 4 of 
Mandelbaum et al. 2007), in the CNOC2 distribution used 
in our training set (see Fig. 2 in Lima et al. 2008), and 
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Figure 14. Distributions of p(2:truo) {solid grey histograms) and p(2train) {hatched histograms) for three training-set galaxies in tlie mock 
SDSS sample. The vertical solid (dashed) lines indicate 2spcc(zpijot) for each galaxy. Left: an early-typo galaxy at z = 0.48; middle: a 
late-type galaxy at z = 0.56; right: a faint, early-type galaxy at z = 0.31. 
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Figure 15. Left panels: Estimated photo-z bias vs. redshift for the weighted and unweighted training set of the SDSS DR6 catalog for 
four cases: {top left) Dl photo-z's with r < 21, {top right) Dl photo-z's with r < 22, {bottom left) CC2 photo-z's with r < 21, and {bottom 
right) CC2 photo-z's with r < 22. Right panels: Estimated photo-z scatter vs. redshift for the weighted and unweighted training set of 
the SDSS DR6 catalog for the same cases depicted in the left panels. In each plot the dashed line corresponds to the unweighted result, 
the solid dark line to the weighted result, and the solid rod line is a 3rd order polynomial fit to the weighted result. The fit coefficients 
are given in Table 5. 



in the full CNOC2 sample shown in Lin et al. (1999). The 
feature in the DEEP2 data appears to be caused, at least 
partially, by spectroscopic failures affecting both early- and 
late-type galaxies in that redshift range (J. Newman, private 
communication), and it is possible that this is affecting the 
weighted estimate. In general, one should not expect that 
the redshift distribution of a sample flux-limited in one filter 
will be smooth, due to k-correction-like effects. The complex 
shape of the spectral energy distributions of galaxies implies 
that a flux limit based on a single filter will preferentially 
select certain galaxy types at certain redshifts. We do not 
see such a feature in the mock SDSS catalog, because the 



mock was created with a smooth r-magnitude distribution 
and redshift distribution, and we only applied a cut in the 
r-band. 

6.3 A p(z) Catalog for SDSS DR7 

We calculated p(z)'s for the full SDSS DR7 sample satisfying 
the selection cuts of the Photoz2 photometric redshift table 
described in Appendix Al and in Oyaizu et al. (2008) - a to- 
tal of 78,135,961 galaxies. We added a sample of 4,241 galax- 
ies with spectra from zCOSMOS with quality flags 2.5, 3.4, 
3.5, 4.4, 4.5, 9.3, 9.4, 9.5 (Lilly et al. 2007) and 1,828 galax- 
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Table 5. Fit coefficients to the weighted estimates of photo-^ bias and scatter vs. redshift for SDSS DR6 catalog 





Dl Photo-2' 

r < 21 


s 

r < 22 


b{z) 
a{z) 


[0.0900269,-0.293255,0.262842,-0.523857] 
[0.167949,-0.82395,1.69819,-0.484006] 


[0.16574,-0.35082,0.192806,-0.355683] 
[0.273305,-0.788055,0.951591,-0.0426683] 




CC2 Photo-z 


's 


b(z) 
a{z) 


[0.0884344, -0.0574277, -0.607687, 0.279678] 
[0.217213, -0.77692, 1.36055, -0.406967] 


[0.193711, -0.527042, 0.421479, -0.408717] 
[0.329747, -1.0009, 1.31667, -0.262655] 



All fits are 3rd order polynomials of the form ai + a2Z + a^z + a^z'^ 
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Figure 16. Estimated redshift distribution for the SDSS DR6 sample (with r < 22), computed using the weighting method {hatched] 
and the mean of the bootstrap samples {solid line). The error bars are the diagonal bootstrap errors. 



ies from DEEP2-EGS (Davis et al. 2007) with Zquaiity > 3 
to the training set. We do not use the ubercalibrated mag- 
nitudes (Padmanabhan et al. 2008) available for DR7 be- 
cause these were not available for most of our training set 
galaxies. The catalog is available from the SDSS DR7 value- 
added catalogs website^. There are 240 files, ordered by RA, 
one for every 0.1 hour of RA from — 23h Thus, the file 
named pofz.ral2h3.dat has photo-z's and p{zYs for objects 
with 12.3/1 RA < 12.4/i, and so forth. The p(z) values 
are tabulated for 100 redshift bins, centered at z = 0.03 to 
1.47, with redshift spacing dz = 1.44/99. To reduce effects of 
Poisson noise we adopt a 'moving window' smoothing tech- 
nique. Each entry for a given p(z) is calculated based on 
a bin of width 4 * dz. As discussed in §6.2, the quality of 
the estimates degrades rapidly for r > 21.5. We therefore 
recommend a cut in brightness of at least r < 21.8. 



^ http : //www. sdss . org/dr7/products/value_added/index.html 
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7 DISCUSSION AND FUTURE WORK 

We have extended and applied the weighting technique of 
estimating redshift distributions (Lima et al. 2008). The 
weighting procedure allows one to use a spectroscopic train- 
ing set to accurately estimate the bias and scatter of photo- 
z's as a function of redshift. In addition, the weighting 
method provides a natural, robust way to select galaxies 
in the photometric sample that are well represented in the 
training set. Moreover, we have shown that the weighting 
technique provides a precise estimate of the redshift distri- 
bution of a photometric sample in the region of observable 
space where the training set and the photometric sample in- 
tersect. The estimate A'^(z)wei more accurately estimates the 
redshift distribution for a photometric sample than meth- 
ods based on photo-z's. We have also extended the weight- 
ing method to estimate the redshift probability distribution 
function for individual galaxies, p{z). Use of this PDF can 
substantially reduce biases associated with the use of single- 
point photo-z's, and we recommend its use in the analysis 
of future photometric galaxy surveys. 
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Figure 17. (Le/t) Weighted estimates of the redshift distribution for the SDSS DR6 photometric sample, with r < 21.5, r < 21.8, and 
r < 22. {Right) p(z) estimates of redshift distribution for SDSS DR7 photometric sample using the full training set as well as a training 
set without the DEEP2 (the non-EGS part) and without 2SLAQ. 



We have outlined the potential different sources of error 
of the weights technique and we have demonstrated how to 
use information from the photo-z distribution to reduce sys- 
tematic errors in the weights. We have shown that for the 
SDSS DR7, selection effects in the training set are the dom- 
inant source of error in the estimation of N(z), and that this 
systematic increases sharply with r-magnitude. In particu- 
lar, we have found that the selection of the DEEP2 survey, 
which uses a different set of filters from the SDSS, is the 
dominant source of systematic errors. 

We have made public a catalog of p(z) for ~ 78 million 
SDSS DR7 galaxies. We have also provided fitting-functions 
for the weights-based estimates of the bias and scatter of 
photo-z's as a function of redshift for the Dl and CC2 photo- 
n's of the SDSS DR6. 

For the future, investigations of the weighting method 
should include study of optimizing the weights estimation, 
e.g., with a variable number of nearest neighbors in differ- 
ent regions of observable space, and inclusion of systematic 
effects, e.g., associated with large-scale structure and spec- 
troscopic failures, in the mock catalogs. 
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APPENDIX A: SDSS DR6 DATA SAMPLE 
Al Photometric set 

The Sloan Digital Sky Survey (SDSS) comprises a large- 
area imaging survey of the north Galactic cap, a multi-epoch 
imaging survey of an equatorial stripe in the south Galac- 
tic cap, and a spectroscopic survey of roughly 10® galax- 
ies and 10^ quasars (York et al. 2000). The survey used a 
dedicated, wide-field, 2.5m telescope (Gunn et al. 2006) at 
Apache Point Observatory, New Mexico. Imaging was car- 
ried out in drift-scan mode using a 142 mega-pixel camera 



© 0000 RAS, MNRAS 000, 000-000 



Estimating the Redshift Distribution of Photometric Galaxy Samples II. Applications and Tests of a New Method 



(Gunn et al. 2006) that gathers data in five broad bands, 
ugriz, spanning the range from 3,000 to 10,000 A (Fukugita 
et aL 1996), with an effective exposure time of 54.1 seconds 
per band. The images were processed using specialized soft- 
ware (Lupton et al. 2001; Stoughton et al. 2002) and were 
astrometrically (Pier et al. 2003) and photometrically (Hogg 
et al. 2001; Tucker et al. 2006) calibrated using observations 
of a set of primary standard stars (Smith et al. 2002) ob- 
served on a neighboring 20-inch telescope. 

The imaging in the sixth SDSS Data Release (Adelman- 
McCarthy et al. 2007, hereafter DR6) covers a nearly con- 
tiguous region of the north Galactic cap. In any region where 
imaging runs overlap, one run was declared primary^ and 
was used for spectroscopic target selection; other runs were 
declared secondary. The area covered by the DR6 primary 
imaging survey, including the southern stripes, is 8520 deg^, 
but DR6 includes both the primary and secondary observa- 
tions of each area and source (Adelman-McCarthy et al. 
2007). 

In this paper, we use a random 1% subset of the SDSS 
DR6 Photoz2 catalog described in Oyaizu et al. (2008) as 
our photometric sample. The Photoz2 catalog contains all 
primary objects from DR6 (drawn from the SDSS CasJobs 
website*) that have the TYPE flag equal to 3 (the type for 
galax;y) and that do not have any of the flags BRIGHT, 
SATURATED, SATUR.CENTER, or NOPETRO^IG set. 
For the deflnitions of these flags we refer the reader to the 
PHOTO flags entry at the SDSS website^ The fuU Photoz2 
photometric sample comprises 77, 418, 767 galaxies. The r 
magnitude, g — r, and r — i color distributions are shown in 
the bottom panels of Figs, la and 2a. 

A2 Spectroscopic training samples 

As noted in the text, the spectroscopic training sample 
we use for SDSS DR6 is drawn from a number of spec- 
troscopic galaxy catalogs that overlap with SDSS imaging. 
Each survey providing spectroscopic redshifts deflnes a red- 
shift quality indicator; we refer the reader to the respective 
publications listed below for their precise definitions. For 
each survey, we chose a redshift quality cut roughly corre- 
sponding to 90% redshift confidence or greater. The SDSS 
spectroscopic sample provides 531, 672 redshifts, principally 
from the MAIN and Luminous Red Galaxy (LRG) sam- 
ples, with confidence level Zconi > 0.9. The remaining red- 
shifts are: 21, 123 from the Canadian Network for Observa- 
tional Cosmology (CNOC) Field Galaxy Survey (CN0C2; 
Yee et al. 2000), 1,830 from the Canada-France Redshift 
Survey (CFRS; Lilly et al. 1995) with Class > 1, 31, 716 from 
the Deep Extragalactic Evolutionary Probe (DEEP; Davis 
et al. 2001) with = A or B and from DEEP2 (Weiner et al. 
2005)'^ with Equality ^ 3, 728 from the Team Keck Redshift 
Survey (TKRS; Wirth et al. 2004) with Equality > -1, and 
52, 842 LRGs from the 2dF-SDSS LRG and QSO Survey 
(2SLAQ; Cannon et al. 2006)'^ with Zop ^ 3. 



For the precise definition of primary objects see 
http: //cas . sdss . org/dr6/ en/help/docs/glossary . asp#P 
^ http: //cas jobs . sdss . org/casjobs/ 

* http : //cas . sdss . org/drS/en/help/browser /browser . asp 

6 http://deep.berkeley.edu/DR2/ 

^ http: / /Irg. physics .uq. edu. au/New_dataset 2/ 



We positionally matched the galaxies with spectroscopic 
redshifts against photometric data in the SDSS BestRuns 
CAS database, which allowed us to match with photometric 
measurements in different SDSS imaging runs. The above 
numbers for galaxies with redshifts count independent pho- 
tometric measurements of the same objects due to multi- 
ple SDSS imaging of the same region; in particular SDSS 
Stripe 82 has been imaged a number of times. The numbers 
of unique galaxies used from these surveys are 1,435 from 
CNOC2, 272 from CFRS, 6, 049 from DEEP and DEEP2, 
389 from TKRS, and 11,426 from 2SLAQ. The SDSS spec- 
troscopic samples were drawn from the SDSS primary galaxy 
sample and therefore are all unique. 



APPENDIX B: SDSS DR6 MOCK CATALOG 

Using spectral template libraries and observational data on 
the redshift-dependent luminosity functions of galaxies of 
different types, we have constructed mock photometric and 
spectroscopic samples that reproduce the main features of 
the real SDSS DR6 samples. In particular, we fit simple 
polynomial functions to the Schechter parameters of Zucca 
et al. (2006) to derive a continuous relationship between 
the Schechter parameters M*,a,(j)*, redshift z, and galaxy 
type T, using the centroid of each redshift bin for the fit. 
To regularize the fits, we visually extrapolate the results of 
Zucca et al. (2006) to the z — (0, 0.2) bin and, where needed, 
for the (1.2, 1.5) bin. 

The Schecter luminosity function is defined as 

a + l 



(j){M)dM = -<^*(lnlO) 
o 



X exp 



105 



(]\/* -A/) 



(Bl) 



where <j){M)dM is the number of galaxies with absolute mag- 
nitudes between M and M + dM. 

The Schechter parameters we use are shown in Ta- 
ble Bl. The polynomials we derive are: 



(B2) 
(B3) 

(B4) 



a = biT^ + h2Tz + hzz + hiz^ + 65 

M* = ClT^ + C2Tz + CSZ + C4,Z^ + C5 

4)* = diT'^ + d2Tz + dzz -\- d^z^ + dg 
+d(iT'^z + drT'^ 

We find the best-fit coefficients to be: 

b = [-0.087,0.050,0.998,-1.143,-0.383], 
c = [0.068,-0.202,-0.806,0.227,-19.86], 
d = [2.04,-5.20,-0.636,0.910,4.181,1.417,-0.536]. 



APPENDIX C: ARTIFICIAL NEURAL 
NETWORK PHOTO-Z'S 

For comparison with the weighting method, we use an Artifi- 
cial Neural Network (ANN) method to estimate photometric 
redshifts (Colfister & Lahav 2004; Oyaizu et al. 2008) We 
use a particular type of ANN called a Feed Forward Mul- 
tilayer Perceptron (FFMP), which consists of several nodes 
arranged in layers through which signals propagate sequen- 
tially. The first layer, called the input layer, receives the in- 
put photometric observables (magnitudes, colors, etc.). The 
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Table Bl. Schcchtcr luminosity function parameters (Zucca ct al. 2006) used to derive polynomial fits to the relationships between the 
Schechter luminosity function parameters, redshift, and galaxy spectral type. The parameters in Zucca et al. (2006) were derived using 
the _B-band of the VVDS survey; here we use them to generate the r-band magnitude distributions, using the appropriate k-corrections 
by galaxy type (Blake & Bridle 2005) 



Type z-bin a - blog{h) (j>* {lO'^h^ Mpc~'^) 



1 
1 
1 
1 
1 
1 
1 


0.0 - 
0.2 - 
0.4 - 
0.6 - 
0.8 - 
1.0 - 
1.2 - 


0.2 
0.4 
0.6 
0.8 
1.0 
1.2 
1.5 


„ ^ LA 

'^•^''-0.30 
-0 04+°-2* 

-0 40+°-2° 

-n 22+° " 

„0 01+°-25 
^■'J^-0.24 

, r,q + 0.34 

^■^■^-0.34 
-1 S0+°*° 


-20.00t°|° 
-20.27tg;2^ 
-20.49+°}^ 

r,r, r,r, + 0.09 
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"•^■^-0.70 
E 1C+0.64 
O-J^O-0.64 
19+0.30 

■'•^^-o.so 

c-o+0.25 
o.dO_o 25 

9 qQ+0.22 
z.03_p 22 

9 q+0.30 
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2 
2 
2 
2 
2 
2 
2 


0.0 - 
0.2 - 
0.4 - 
0.6 - 
0.8 - 
1.0 - 
1.2 - 


0.2 
0.4 
0.6 
0.8 
1.0 
1.2 
1.5 


U.DU_Q 20 

-0 67+° " 

-0.13 

-0 ';n+°'" 

'-'■°^-0.14 

-n ^7+° i3 

-0.13 
-0 6Q+°-2° 

U.DU_Q 20 

-0 76+°-34 

'"-0.33 
, c:7+0.61 
-^■^'-0.62 


^„ „„-l-0 on 

-20.00l°-J° 
-20.13+°i? 

— 19 07+012 
!».»/ _0 j^2 

-20 39+° °9 

ZU.JM_g j^O 
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7 60^ 

' •"JU_o go 
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2 19+0-22 
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3 
3 
3 
3 
3 
3 
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0.0 - 
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0.2 
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next layers, denoted hidden layers, propagate signals until 
the output layer, -whose outputs are the desired quantities, 
in this case the photo-z estimate. Following the notation of 
Collister & Lahav (2004) , we denote a network with k layers 
and Ni nodes in the i"" layer as Ni : N2 : ... : Nk. 

A given node can be specified by the layer it belongs to 
and the position it occupies in the layer. Consider a node 
in layer i and position a with a — 1,2, Ni. This node, 
denoted Pia, receives a total input lia and fires an output 
Oia given by 



(Cl) 



where F{x) is the activation function. The photometric ob- 
servables are the inputs I\a to the first layer nodes, which 
produce outputs Oia- The outputs Oia in layer i axe prop- 
agated to nodes in the next layer (i -\- 1), denoted P(i+i)^, 
with /3 = 1,2, ..Ni+i. The total input /(i+i)/3 is a weighted 
sum of the outputs Oia 



'(i+l)/3 



Ni 



a = l 



^ial^Oia , 



(C2) 



where Wiap is the weight that connects nodes Pia and 
P(i+i)/3. Iterating the process in layer i -I- 1, signals prop- 
agate from hidden layer to hidden layer until the output 
layer. There are various choices for the activation function 
F{x) such as: a sigmoid, a hyperbolic tangent, a step func- 
tion, a linear function, etc. The choice of the activation func- 
tion typically has no important effect on the final photo-z's, 
and different activation functions can be used in different 
layers. In our implementation, we use a network configura- 
tion Nm : 15 : 15 : 15 : 1, which receives Nm magnitudes 
and outputs a photo-z. We use hyperbolic tangent activa- 
tion functions in the hidden layers and a linear activation 
function for the output layer. 
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